Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach

نویسندگان

چکیده

Abstract The interpretation of feature importance in machine learning models is challenging when features are dependent. Permutation (PFI) ignores such dependencies, which can cause misleading interpretations due to extrapolation. A possible remedy more advanced conditional PFI approaches that enable the assessment on all other features. Due this shift perspective and order correct interpretations, it beneficial if conditioning transparent comprehensible. In paper, we propose a new sampling mechanism for distribution based permutations subgroups. As these subgroups constructed using tree-based methods as transformation trees, becomes inherently interpretable. This not only provides simple effective estimator PFI, but also local estimates within addition, apply approach partial dependence plots, popular method describing effects suffer from extrapolation dependent interactions present model. simulations real-world application, demonstrate advantages subgroup over existing methods: It allows compute true data than proposals enables fine-grained

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing Program Performance via Similarity, Using a Feature-Agnostic Approach

This work proposes a new technique for performance evaluation to predict performance of parallel programs across diverse and complex systems. In this work the term system is comprehensive of the hardware organization, the development and execution environment. The proposed technique considers the collection of completion times for some pairs (program, system) and constructs an empirical model t...

متن کامل

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...

متن کامل

Determining Effective Features for Face Detection Using a Hybrid Feature Approach

Detecting faces in cluttered backgrounds and real world has remained as an unsolved problem yet. In this paper, by using composition of some kind of independent features and one of the most common appearance based approaches, and multilayered perceptron (MLP) neural networks, not only some questions have been answered, but also the designed system achieved better performance rather than the pre...

متن کامل

Developing a Model for Organizing Substance-Dependent Homeless and Street Women in Kermanshah: A Qualitative Approach with Grounded Theory

Objective: The aim of this study was to develop a model for organizing substance-dependent homeless and street women in Kermanshah. Method: The present study was conducted using the grounded theory method. The study population included all substance-dependent homeless and street women in Kermanshah. 37 people were selected by purposive sampling method. The data were analyzed by coding method. R...

متن کامل

the effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension

کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Mining and Knowledge Discovery

سال: 2023

ISSN: ['1573-756X', '1384-5810']

DOI: https://doi.org/10.1007/s10618-022-00901-9